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DETAILED ACTION 

1 . This is responsive to the RCE filed on 10 November 2008. 

2. Claims 1, 2, 6, 7, 9, 14, and 15 are pending and considered below. All the 
independent claims (1, 7, and 15) have been amended. 

Response to Amendment 

3. The claim objection of claim 1 is withdrawn in view of Applicant's amendments. 

Response to Arguments 

4. Applicant's arguments with respect to claims 1, 2, 6, 7, 9, 14, and 15 have been 
considered but are moot in view of the new ground(s) of rejection. 

Claim Objections 

5. Claims 7, 14, and 15 are objected to because of the following informalities: in line 
1 5 of claim 7 and line 1 7 of claim 1 5, the limitation "determining if the phone includes a 
voice file" is recited. There is insufficient antecedent basis for "the phone" in the claims. 
The Examiner will interpret the limitation as 'determining if a phone includes a voice file'. 
Also, in line 17 of claim 7 and line 19 of claim 15, the limitation "corresponding to the 
first collection" is recited. There is insufficient antecedent basis for "the first collection" 
in the claims. The Examiner will interpret the limitation as 'corresponding to the selected 
collection'. Further, in line 1, claim 14 recites "the cellular phone", which does not have 
sufficient antecedent basis in the claim, and which will be interpreted as 'the phone'. 
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Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

7. Claims 1, 6, 7, 9, 14, and 15 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Wu et al (US PGPub 2003/0061048) in view of Ostermann et al 
(USPN 6,976,082). 

Claim 1: 

Wu discloses a system for generating a collection of speech generation 
commands associated with computer readable information (Abstract), comprising: 

a first computer (network server) configured to generate a first collection of 
speech generation commands (coded speech parameters) based on a first portion of 
computer readable information (text) (see [0019]); 

the first computer in communication with a communication network and a phone 
operatively communicating with the communication network, wherein signals generated 
by the first computer are transmitted through the communication network to the phone 
("transmitting the coded speech parameters from a network server to a wireless 
communication device", [0019]). 

Wu further discloses either the phone receiving the first collection of speech 
generation commands and accessing a predetermined set of the speech samples in the 
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voice file based on the first collection of speech generation commands to generate 
auditory speech ("the native coded speech parameters, corresponding to each of the 
phonics from the previous step and along with suitable spaces, are subsequently 
processed in a signal processor 208 (such as a DSP for example) to provide a 
decompressed speech signal to an audio circuit 210 of the cellular phone handset", 
[0018]) or the phone receiving signals corresponding to auditory speech and generating 
auditory speech from the received signals ("Alternatively, a network server of the 
communication system can converts this formatted text string to speech and transmit 
this speech to a conventional cellular handset over a voice channel instead of a data 
channel", [0011]). In other words, Wu discloses, either receiving textual information in 
the form of coded speech parameters and performing a text-to-speech process at the 
phone or, performing the text-to-speech (TTS) process at a server and transmitting 
speech to the phone. 

However, Wu does not explicitly disclose determining whether the phone 
includes a voice file (i.e. is able to perform text-to-speech which require voice files for 
concatenation based TTS) and conducting the text-to-speech, either at the phone if a 
voice file is present at the phone or, at the server if a voice file is not present on the 
phone. 

In a similar network based text-to-speech system, Ostermann discloses checking 
if a phone (col. 6, lines 5-11) has speech synthesis software (which require voice files 
for concatenation based TTS) and performing the TTS at the phone if the phone has 
TTS capabilities or performing the TTS at a server and transmitting synthesized speech 
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to the device from the server if the device does not have speech synthesis software 
(col. 11, lines 15-26). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to check Wu's phone for TTS capabilities (voice files in concatenation based 
TTS) and performing TTS on the phone, if the phone has voice files, or performing the 
TTS on a server and transmitting synthesized speech, if the phone does not have TTS 
capabilities, because a phone cannot perform TTS if it does not have TTS capabilities 
(voice files in concatenation based TTS). 

Claim 6: 

Wu and Ostermann disclose the system of claim 1 , Wu further discloses wherein 
the first computer further includes a memory having a voice file stored therein, the voice 
file having a plurality of speech samples from a predetermined person, the first 
collection of speech generation commands being associated with a predetermined set 
of the plurality of speech samples (Fig. 2, element 202 and related text). 

Claim 7: 

Wu discloses a method for generating a collection of speech generation 
commands (Abstract), comprising: 

generating a first collection of speech generation commands (coded speech 
parameters) based on a first portion of computer readable information (text message) in 
a first computer (Fig. 1, step 108 and related text); 

wherein the first computer includes a memory having a voice file stored therein, 
the voice file having a plurality of speech generation commands associated with speech 
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samples of a person (Fig. 2, element 202 and related text), wherein the generation of 
the first collection of speech generation commands includes: 

generating phonetic units (phonics) associated with the first portion of computer 
readable information (text message) (Fig. 1, item 106 and related text); 

comparing a phonetic unit to phonetic units stored in the voice file (code table, 
Fig. 2, element 202 and related text) to determine a matched phonetic unit; and 
selecting a speech generation command in the voice file associated with the matched 
phonetic unit (Fig. 1 , step 108 and related text). 

Wu does not explicitly disclose that the phonetic units associated with the text 
message and the phonetic units stored in the code table are composed of phonemes 
and multi-phonemes. 

However, in the Background of The Invention, Wu discloses that phonemes 
(phones) and multi-phonemes (diphones) are used as phonetic units ([0004]). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to represent Wu's phonetic units using phonemes and multi-phonemes 
because they are well known standards in text-to-speech systems. 

Wu further discloses either the phone receiving the first collection of speech 
generation commands and accessing a predetermined set of the speech samples in the 
voice file based on the first collection of speech generation commands to generate 
auditory speech ("the native coded speech parameters, corresponding to each of the 
phonics from the previous step and along with suitable spaces, are subsequently 
processed in a signal processor 208 (such as a DSP for example) to provide a 
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decompressed speech signal to an audio circuit 210 of the cellular phone handset", 
[0018]) or the phone receiving signals corresponding to auditory speech and generating 
auditory speech from the received signals ("Alternatively, a network server of the 
communication system can converts this formatted text string to speech and transmit 
this speech to a conventional cellular handset over a voice channel instead of a data 
channel", [0011]). In other words, Wu discloses, either receiving textual information in 
the form of coded speech parameters and performing a text-to-speech process at the 
phone or, performing the text-to-speech (TTS) process at a server and transmitting 
speech to the phone. 

However, Wu does not explicitly disclose determining whether the phone 
includes a voice file (i.e. is able to perform text-to-speech which require voice files for 
concatenation based TTS) and conducting the text-to-speech, either at the phone if a 
voice file is present at the phone or, at the server if a voice file is not present on the 
phone. 

In a similar network based text-to-speech system, Ostermann discloses checking 
if a phone (col. 6, lines 5-11) has speech synthesis software (which require voice files 
for concatenation based TTS) and performing the TTS at the phone if the phone has 
TTS capabilities or performing the TTS at a server and transmitting synthesized speech 
to the device from the server if the device does not have speech synthesis software 
(col. 11, lines 15-26). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to check Wu's phone for TTS capabilities (voice files in concatenation based 
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TTS) and performing TTS on the phone, if the phone has voice files, or performing the 
TTS on a server and transmitting synthesized speech, if the phone does not have TTS 
capabilities, because a phone cannot perform TTS if it does not have TTS capabilities 
(voice files in concatenation based TTS). 
Claim 9: 

Wu and Ostermann disclose the method of claim 7, Wu further discloses wherein 
the comparing of a phoneme or multi-phoneme to phonemes and multi-phonemes 
stored in the voice file to determine a matched phoneme or multi-phoneme includes: 

comparing a multi-phoneme to multi-phonemes stored in the voice file; and, 
comparing a phoneme to phonemes stored in the voice file ("mapping each of the 
phonics from the audio server, by a mapping unit 206, against the code table 202 to find 
the coded speech parameters corresponding to each of the phonics", [0015]). 

Claim 14: 

Wu and Ostermann disclose the method of claim 13, Wu further discloses 
wherein the phone includes a memory having a voice file (audio file) stored therein, the 
method further comprising accessing portions of the voice file based on the first 
collections of speech generation commands to generate auditory speech ("the native 
coded speech parameters, corresponding to each of the phonics from the previous step 
and along with suitable spaces, are subsequently processed in a signal processor 208 
(such as a DSP for example) to provide a decompressed speech signal to an audio 
circuit 210 of the cellular phone handset", [001 8]). 

Claim 15: 
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Claim 15 is directed to a storage medium encoding program code for performing 
the method of claim 7. It is old and well-known to encode program code for performing a 
method on a computer storage medium and implement instructions corresponding to 
the program code on a computer's processor. Accordingly, claim 15 is rejected with the 
same rationale as applied above with respect to method claim 7. 

8. Claim 2 is rejected under 35 U.S.C. 103(a) as being unpatentable over Wu et al 
(US PGPub 2003/0061048) in view of Ostermann et al (USPN 6,976,082) and in further 
view of Walker et al (US PGPub 2001/0047260). 
Claim 2: 

Wu and Ostermann disclose the system of claim 1 but they do not explicitly 
disclose a second computer configured to receive the second portion of computer 
readable information from the first computer and to generate a second collection of 
speech generation commands based on the second portion of computer readable 
information, the first computer is further configured to receive the second collection of 
speech generation commands from the second computer and to generate a third 
collection of speech generation commands based on the first and second collection of 
speech generating commands, wherein the first computer generates signals based on 
the third collection of speech generation commands 

In a similar text-to-speech system, Walker discloses a second computer (item 
22b, Fig. 2) configured to receive the second portion of computer readable information 
from a first computer and to generate a second collection of speech generation 
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commands based on the second portion of computer readable information (Fig. 2, item 
22 and related text), the first computer is further configured to receive the second 
collection of speech generation commands from the second computer and to generate a 
third collection of speech generation commands based on the first and second 
collection of speech generating commands (Fig. 2, item 24 and related text, [0030]); 
wherein the first computer generates signals based on the third collection of speech 
generation commands ("Streaming buffer 24 transmits the speech segments in the 
proper order along with the telephony user address to voice application", [0031]). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to perform, in Wu's system, the text-to-speech process using a plurality of 
engines because the resulting system "efficiently processes text documents of any size" 
(Walker, [0018]) by dividing the text into easily manageable portions. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to SAMUEL G. NEWAY whose telephone number is 
(571)270-1058. The examiner can normally be reached on Monday - Friday 8:30AM - 
5:30PM EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David R Hudspeth can be reached on 571-272-7843. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 

IS. G. N./ 

Examiner, Art Unit 2626 



